2023-10-09 16:21:27.AIbase.1.9k
Microsoft's Progress in Training Mini Language Models is Remarkable
Researcher Ellie Pavlick states that training mini language models is akin to sequencing the fruit fly genome rather than the human genome. Microsoft researchers have achieved significant results using children's stories to train mini language models. Compared to large language models, mini language models train faster and have mechanisms that are easier to understand. This approach helps in training larger models and also aids in analyzing their behavior. Research findings suggest that after being trained with children's stories, mini language models can narrate coherent and grammatically correct stories.